首页> 外文OA文献 >Single Document Automatic Text Summarization Using Term Frequency-Inverse Document Frequency (TF-IDF)
【2h】

Single Document Automatic Text Summarization Using Term Frequency-Inverse Document Frequency (TF-IDF)

机译:使用术语频率-反文档频率(TF-IDF)的单文档自动文本摘要

摘要

The increasing availability of online information has triggered an intensive research in the area of automatic text summarization within the Natural Language Processing (NLP). Text summarization reduces the text by removing the less useful information which helps the reader to find the required information quickly. There are many kinds of algorithms that can be used to summarize the text. One of them is TF-IDF (TermFrequency-Inverse Document Frequency). This research aimed to produce an automatic text summarizer implemented with TF-IDF algorithm and to compare it with other various online source of automatic text summarizer. To evaluate the summary produced from each summarizer, The F-Measure as the standard comparison value had been used. The result of this research produces 67% of accuracy with three data samples which are higher compared to the other online summarizers.
机译:在线信息的可用性不断提高,引发了对自然语言处理(NLP)中自动文本摘要领域的深入研究。文本摘要通过删除不太有用的信息来减少文本,这有助于读者快速找到所需的信息。有许多种算法可用于总结文本。其中之一是TF-IDF(TermFrequency-Inverse文档频率)。这项研究的目的是生产一种使用TF-IDF算法实现的自动文本摘要器,并将其与其他各种在线自动文本摘要器来源进行比较。为了评估每个汇总器生成的汇总,已使用F-Measure作为标准比较值。这项研究的结果使用三个数据样本产生了67%的准确度,这比其他在线汇总器要高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号